Planning and Learning with Adaptive Lookahead

نویسندگان

چکیده

Some of the most powerful reinforcement learning frameworks use planning for action selection. Interestingly, their horizon is either fixed or determined arbitrarily by state visitation history. Here, we expand beyond naive and propose a theoretically justified strategy adaptive selection as function state-dependent value estimate. We two variants lookahead analyze trade-off between iteration count computational complexity per iteration. then devise corresponding deep Q-network algorithm with an tree search horizon. separate estimation depth to compensate off-policy discrepancy depths. Lastly, demonstrate efficacy our method in maze environment Atari.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information-Lookahead Planning for AUV Mapping

Exploration for robotic mapping is typically handled using greedy entropy reduction. Here we show how to apply information lookahead planning to a challenging instance of this problem in which an Autonomous Underwater Vehicle (AUV) maps hydrothermal vents. Given a simulation of vent behaviour we derive an observation function to turn the planning for mapping problem into a POMDP. We test a vari...

متن کامل

A Lookahead Strategy for Heuristic Search Planning

Relaxed plans are used in the heuristic search planner FF for computing a numerical heuristic and extracting helpful actions. We present a novel way for extracting information from the relaxed plan and for dealing with helpful actions, by considering the high quality of the relaxed plans in numerous domains. For each evaluated state, we employ actions from these plans in order to find the begin...

متن کامل

Machine Learning for Adaptive Planning

This chapter is concerned with the enhancement of planning systems using techniques from Machine Learning in order to automatically configure their planning parameters according to the morphology of the problem in hand. It presents two different adaptive systems that set the planning parameters of a highly adjustable planner based on measurable characteristics of the problem instance. The plann...

متن کامل

Learning Rules for Adaptive Planning

This paper presents a novel idea, which combines Planning, Machine Learning and Knowledge-Based techniques. It is concerned with the development of an adaptive planning system that can fine-tune its planning parameters based on the values of specific measurable characteristics of the given planning problem. Adaptation is guided by a rule-based system, whose knowledge has been acquired through m...

متن کامل

A neuromorphic model of spatial lookahead planning

In order to create spatial plans in a complex and changing world, organisms need to rapidly adapt to novel configurations of obstacles that impede simple routes to goal acquisition. Some animals can mentally create successful multistep spatial plans in new visuo-spatial layouts that preclude direct, one-segment routes to goal acquisition. Lookahead multistep plans can, moreover, be fully develo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i8.26149